Coping with disfluencies in spontaneous speech recognition
نویسندگان
چکیده
Nowadays, automatic speech recognizers have become quite good in recognizing well prepared fluent speech (e.g. news readings). However, the recognition of spontaneous speech is still problematic. Some important reasons for this are that spontaneous speech is usually less articulated and contains a lot of disfluencies. In this paper, a new methodology for coping with disfluencies is presented and evaluated. The basic idea is to detect disfluencies and to determine the nature of these disfluencies prior to the recognition, and to use that information to control/modify the search. At present, the methodology has been elaborated for filled pauses (FP) and word repetitions (WR). It enables us to eliminate about one associated normal word error per disfluency without introducing a significant augmentation of the computational load.
منابع مشابه
Coping with disfluencies in spontaneous speech recognition: Acoustic detection and linguistic context manipulation
Nowadays read speech recognition already works pretty well, but the recognition of spontaneous speech is much more problematic. There are plenty of reasons for this, and we hypothesize that one of them is the regular occurrence of disfluencies in spontaneous speech. Disfluencies disrupt the normal course of the sentence and when for instance word interruptions are concerned, they also give rise...
متن کاملBenefits of Disfluency Detection in Spontaneous Speech Recognition
Nowadays, automatic speech recognizers have become quite good in recognizing well prepared fluent speech (e.g. news readings). However, the recognition of spontaneous speech is still problematic. Some reasons for this are that spontaneous speech is usually less articulated and that it can contain a lot of disfluencies such as filled pauses (FPs), abbreviatons, repetitions, etc. In this paper, a...
متن کاملHandling Disfluencies in Spontaneous Language Models
In automatic speech recognition, a stochastic language model (LM) predicts the probability of the next word on the basis of previously recognized words. For the recognition of dictated speech this method works reasonably well since sentences are typically well-formed and reliable estimation of the probabilities is possible on the basis of large amounts of written text material. However, for spo...
متن کاملAutomatic Detection and Removal of Disfluencies from Spontaneous Speech
Unlike rehearsed and prepared speech, spontaneous speech contains high occurrence of disfluencies, like repetitions, filled pauses, and hesitations. Disfluencies can seriously hamper the word recognition accuracy of an Automatic Speech Recogniser (ASR), by increasing word insertion and deletion and rejection rates. In this paper we introduce signal processing algorithms to automatically identif...
متن کاملEvaluation of sublexical and lexical models of acoustic disfluencies for spontaneous speech recognition in Spanish
Spontaneous speech is full of acoustic disfluencies that rarely appear in read or laboratory speech. A very simple and straightforward approach is presented, in which acoustic disfluences are modelled by augmenting the inventory of sublexical units, which originally consisted of 23 context independent phones plus a special unit for silent pauses. This set was augmented with 12 additional units ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004